Pytorch Seq2Seq Translator From Scratch: Attention & Rnn Part 1